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Abstract 



Expansion of the neocortex is a hallmark of human evolution. However, it remains an 

P^ open question what adaptive mechanisms facilitated its expansion. Here we show, using 

• gyrencephaly index (GI) and other physiological and life-history data for 102 mammalian 

"q species, that gyrencephaly is an ancestral mammalian trait. We provide evidence that the 

1 

Q^ evolution of a highly folded neocortex, as observed in humans, requires the traversal of 

a threshold of ~10^ neurons, and that species above and below the threshold exhibit a 

t> bimodal distribution of physiological and life-history traits, establishing two phenotypic 

1-H groups. We identify, using discrete mathematical models, a basal radial glia-derived transit- 

l/-^ amplifying progenitor as evolutionarily necessary and sufficient for generating a fourteen- 

■^ fold increase in daily prenatal neuron production and thus traversal of the neuronal thresh- 

m old. We demonstrate that length of neurogenic period, rather than any novel progenitor- 

sj type, is sufficient to distinguish cortical neuron number between species within the same 

k>( phenotypic group. 



Introduction 

Development of the human neocortex involves a lineage of neural stem and progenitor cells that 
forms a proliferative region along the ventricular epithelium. The proliferation of cells within 
this region expands the neocortex by increasing neuron number. At the onset of mammalian 
cortical neurogenesis, neuroepithelial cells transform into radially oriented apical radial glia 
(aRG), which proliferate extensively at the apical surface of the ventricular zone and divide 
asymmetrically to self-renew and generate a neuron, intermediate progenitor (IP), or basal ra- 
dial glia (bRG) [IJ. IP cells delaminate from the apical surface and translocate their nucleus to 
the basal region of the ventricular zone (VZ) to form a second germinal layer, the subventric- 
ular zone (SVZ), where they divide symmetrically to generate two neurons f2^4\. Similarly to 
aRG cells at the ventricular surface, bRG cells, which maintain a single fiber ascending only to 
the basal surface, divide asymmetrically [EHHl; but contrary to aRG cells, bRG in the human 
generate neurons via transit-amplifying progenitors (TAPs), a cell-type that is not observed to 
originate basally in the mouse OH. The abventricular expansion of progenitors during corti- 
cal neurogenesis in humans further compartmentalizes the basal region into an inner and outer 
SVZ, driving the radial fibers to have divergent, rather than parallel, trajectories to the cortical 
plate, and thus creating the folded cortical pattern observed in gyrencephalic species through 
the tangential expansion of migrating neurons iHITOl. For this reason, and based on supporting 
evidence obtained in the gyrencephalic human and ferret and lissencephalic mouse, it was origi- 
nally thought that an abundance of bRG cells in the outer SVZ was an evolutionary determinant 
for establishing a relatively large and gyrencephalic neocortex [J5l |6l [HI. But recent work in 
the lissencephalic marmoset {Callithrix jacchus) has shown that bRG cells may, in fact, exist 
in comparable abundance in both gyrencephalic and lissencephalic species [|T2l [T3l and so can- 
not alone be sufficient for either establishing or increasing cortical gyrification. Thus, despite 
considerable progress in the study of brain size evolution [[T4l - [T6l . the adaptive mechanism that 
has evolved along certain mammalian lineages to produce a large and folded neocortex is not 
known. 

In this study, we analyzed physiological and life-history data from 102 mammalian species 
(Table SI; Table S2; External Database 1). We show that a gyrencephalic neocortex is ancestral 
to all mammals (Fig. 1) and that GI (Fig. SI), like brain size, has increased and decreased 
along many mammalian lineages. These changes may be reliably characterized by convergent 
adaptations into two distinct physiological and life-history programs (Fig. 2a), resulting in a 
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Figure 1: Ancestral reconstruction of Gl values for 102 mammalian species. 

Gl values were determined as illustrated in Fig. SI for the species listed in Table SI. 
Reconstructed Gl values for putative ancestors are presented at selected internal nodes 
of the phylogenetic tree. MYA, million years ago; colors indicate taxonomic groups. 
Images of Nissl-stained coronal sections of representative species for each taxonomic 
group, downloaded from http://brainmuseum.org, along with respective Gl values, are 
shown on the right. 

bimodal distribution of mammalian species (Fig. 2b) with a robust threshold value for both Gl 
and neuron number (Fig. 3). Traversal of the threshold requires greater neuron production per 
gestation day (Fig. 4a,b), which we argue is necessitated by the evolution of bRG-derived TAPs 
during cortical neurogenesis (Fig. 5). 

The mammalian ancestor was gyrencephalic 



We tested multiple evolutionary models for Gl evolution. The model that conferred most power 
to explain the Gl values across the phylogeny while making the fewest assumptions about 
the data (i.e., had the lowest Akaike Information Criterion (AIC)) showed a disproportionate 
amount of evolutionary change to have occurred recently, rather than ancestrally, in mammals 
(Fig. S2) and diverged significantly from a null model of stochastic evolution IITtI . We identi- 
fied a folded neocortex (Gl =1.37 ± 0.16 s.e.m.) as an ancestral mammalian trait (Fig. 1). It 
is apparent from ancestral and other internal node reconstructions (Fig. S3a,b) that Gl is very 



variable, but also that reductions in the rate at which GI evolves have favored branches leading 
to decreases in GI (e.g., strepsirrhines and insectivores) and accelerations in that rate have fa- 
vored branches leading to increases in GI (e.g., carnivores and caviomorphs). A simulation of 
the average number of total evolutionary transitions between GI values evidences more affin- 
ity for transitioning from high-to-low than low-to-high GI values: the majority of high-to-low 
transitions (58.3%) occurred in species with a GI < 1.47; and the fewest transitions (16.7%) 
occurred across a threshold value of 1.5 (Fig. S4). This indicates that, although there is an 
evident trend in mammalian history to become increasingly gyrencephalic, the most variability 
in GI evolution has been concentrated among species below a certain threshold value (GI = 
1.5). We therefore present a picture of early mammalian history, contrary to those previously 
painted, but which is gathering evidence through novel approaches [fT8l[T9ll . that the Jurassic-era 
mammalian ancestor may, indeed, have been a large-brained species with a folded neocortex. 

A threshold in cortical neuron number 

The evolutionary effects of a folded neocortex on the behavior and biology of a species is not 
immediately clear. We therefore analyzed associations, across the phylogeny, of GI with dis- 
crete character states of 37 physiological and life-history traits (Table S2). Distinct sets of small 
but significant (R^ < 0.23, P < 0.03) associations were found for species above and below a GI 
value of 1.5, indicating that these two groups of species adapt to their environments differently 
(Fig. 2a). Both groups were sampled from across the phylogeny, showing no phylogenetic 
signal. Clustering analyses also supported a bimodal distribution above and below a threshold 
value of 1.5 (Fig. 2b; Fig. S5). To test the bimodal distribution explicitly, we regressed GI val- 
ues against neuroanatomical traits and found that each scaling relationship could be explained 
comparably well by either a non-linear function (Fig. 3a) or two grade-shifted linear functions, 
with the best-fit linear models drawing significantly different slopes (P = 3.4 x 10^^) for high- 
er > 1.5) and low-GI ( < 1.5) species. (Fig. 3b,c). By plotting GI as a function of cortical 
neuron number, we were able to demarcate, with two significantly different linear regressions 
for high- and low-GI species (T = 4.611, d.f. = 29, P = 2.8 x 10^"*), a cortical no-mans-land 
centered on an area approximating 1 ±0.11 x 10^ neurons and 1.56 ± 0.06 GI (Fig. 3d). The 
deviation of these results from previous work, which have shown strong phylogenetic signals 
associated with both GI [|20l |2T1| and neuron counts [|22l . may be explained by our more than 
2-fold increase in sampled species. Variation in GI, therefore, has not evolved linearly across 
the phylogeny, but has in fact been differentially evolved in two phenotypic groups. Each group 
may be characterized not only by a high (> 1.5) or low (< 1.5) GI value, but also by a distinct 
constellation of other physiological and life-history traits which have accompanied each group 
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Figure 2: Clustering of Gl values based on life-history association analysis (a) 
and minimum-energy distance (b).(a) Stochastic mapping of physiological and life- 
history traits with Gl values for the 102 mammalian species listed in Table SI. Gl values 
were separated into four groups based on clustering. Forty traits, each comprising 3 
- 6 character states, were analyzed (see Table S2 for a complete list), and the states 
showing a significant positive (P, green) or negative (N, red) association with a group 
of Gl values are shown. Note the major overlap between the two low-GI groups (10/27) 
and between the two high-GI groups (9/24), whereas only 3/48 character states are 
shared between Gl groups < 1.5 versus > 1.5. (b) Hierarchical clustering based on 
minimum-energy distance of the Gl values for 101 mammalian species. Note that the 
greatest clustering height is between species with Gl values of < 1.5 and > 1.5. 



over evolutionary time. 

More efficient neurogenesis in large-brained species 

By establishing an evolutionary threshold based on both degree of gyrencephaly and neuron 
number, we identified two neurogenic phenotypic groups, which found support in their distinct 
life-history associations (see previous section). These groups could be further divorced by 
accounting for the amount of brain weight accumulated per gestation day confident proxies 
for neonate brain weight and neurogenic period, respectively (Fig. S6) which we show to 
be, on average, 14-times greater in high- compared to low-GI species (Fig. 4). Notably, each 
GI group is constituted by both altricial and precocial species, so the degree of pre- versus 
post-natal development is not enough to explain the discrepancy in brain weight per gestation 
day in each group. Rather, to explain the discrepancy, we introduced a deterministic model 
of cortical neurogenesis, using series summarizing five neurogenic lineages and based on cell- 
cycle length, neuroepithelial founder pool size, neurogenic period, and estimates of relative 
progenitor-type population sizes (Table 1). We arrived at two models, based on the analysis of 
16 species, that show the highest reliability for predicting cortical neuron numbers in a range 
of species: a mouse model, which implicates only aRG, IP, and bRG; and a human model, 
which additionally implicates proliferating bRG-derived TAPs. Each model is defined by the 
proportional occurrence of each lineage in that model (Table 2). Using the mouse model, with 
varying proportional occurrences of each lineage, we were able to predict neuron counts within 
2% of the observed counts for low-GI species, but underestimated neuron counts by more than 
80% in high-GI species (Fig. 5; Table S3). Similarly, the human model predicted neuron counts 
within 5% for high-GI species, but overestimated neuron counts by more than 150% for low- 
GI species. Increased proportional occurrences of the bRG lineage with increasing brain size 
was required to achieve estimates with < 5% deviation from observed neuron counts in low-GI 
species (Table 2; Fig. S7). Estimates of proportional occurrences in the mouse, marmoset, and 
rabbit are supported by previous work detailing relative abundances of different progenitor cell- 
types during cortical neurogenesis [HI [131, [IK and WBH, in preparation]. Evolutionary gain 
or loss of bRG-derived TAPs is an essential mechanistic determinant of neocortical expansion, 
such that its presence in high-GI species and absence in low-GI species is sufficient and even 
requisite for explaining neocortical evolution (Fig. S8). 
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Figure 3: Ln-transformed plots showing Gl values as a function of brain weight (a, 
b, 101 species), neocortical volume (c, 29 species) and cortical neuron number 
(d, 22 species), (a) Regression analysis using one non-linear fit for all values (y = 
0.018x2 -K 0.037X -h 0.014, R^ = 0.612, P = 6 x 10"^); (b-d) regression analyses using 
two different linear functions (b, blue line: y = 0.075x - 0.481, R^ = 0.56, P = 4 x 
10"^ red line: y = 0.245x + 0.018, R^ = 0.73, P = 1 x 10"^; c, blue line: y = 0.050x 
- 0.194, R2 = 0.21, P = 0.017, red line: y = 0.154x - 1.09, ^= 0.82, P = 0.004; d, 
blue line: y = 0.072x - 1.188, R^ = 0.81, P = 1 x 10"^ red line: y = 0.140x - 2.370, 
R2 = 0.98, P = 3 X 10^^) for species with Gl values of < 1.5 (blue triangles) and > 
1.5 (red circles), respectively; mouse and human are indicated by green symbols. The 
inset in (b) shows the AlC values for models fitted with 1-5 linear slopes; note that a 
two-slope model best explains the data. See Table SI for data. 
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Figure 4: Brain weight per gestation day is considerably greater for high- versus 
low-GI species, (a) Ln-transformed density plot of brain weight per gestation day for 
96 eutherian species listed in Table SI with Gl values of < 1.5 (blue) and > 1.5 (red). 
Note the significantly different means for the two groups (dashed blue and red lines, T 
= 5.16, d.f. = 41, P = 4 X 10^^). (b) Ln-transformed plot of brain weight per gestation 
day for 96 mammalian species (see a). Dashed blue line, mean value for Gl < 1.5 (2.04 
± 0.047, s.d.); red dashed line, mean value for Gl > 1.5 (0.583 ± 0.050, s.d.). The 
colors in the index refer to species in Fig. 1. See Table SI for data. 



Adaptive evolution of a proliferative progenitor cell-type 

To simulate the adaptiveness of evolving TAPs in two lissencephalic species - mouse and mar- 
moset - we calculated trade-offs between neuroepithelial founder pool size and neurogenic pe- 
riod using mouse/marmoset and human models of cortical neurogenesis to achieve one billion 
neurons. We show that, in both species, evolving a lineage of bRG-derived TAPs is between 
2- and 21 -times more cost-efficient than either expanding founder pool size or lengthening 
neurogenesis; and that the marmoset, by evolving bRG-derived TAPs, could in fact reduce its 
observed founder pool size or neurogenic period to achieve one billion neurons (Fig. S9). We 
further clarified the significance of each progenitor-type with deterministic and stochastic mod- 
els of temporal dynamics and progenitor cell-type variables. From these we conclude that basal 
progenitors are increasingly necessary in larger brains and that achieving 10^ neurons is sta- 
tistically implausible in the absence of bRG-derived TAPs (Table S4). Finally, we described 
the dynamics of TAPs, isolated from their observed lineage beginning at the apical surface, 
by introducing three ordinary differential equations (ODEs) modeling a self -renewing cell that 
generates either a differentiated cell or proliferative cell. We show that neuronal output of the 
system increases dramatically when both the initial pool of self-renewing cells and the likeli- 
hood of those initial cells to generate proliferative, rather than differentiated, cells approaches 
saturation (Fig. SIO). 

Discussion 

The emergence of new structures, in the most general sense, is typically limited to selection 
on existing developmental processes; and conserved pathways may persist, over evolutionary 
time, even when the phenotype is transformed or unexpressed [|23l - l25ll . However, it is also evi- 
dent that development may be adapted without affecting phenotype (e.g., [|26ll27ll ). Therefore, 
in order to understand selective pressures acting on a discontinuous or convergent trait, it is 
necessary to investigate the underlying developmental processes generating it. We have shown 
that a gyrencephalic neocortex is ancestral to mammals, which is concordant with evidence 
[fT9l that the mammalian ancestor was large ( > 1kg) and long-lived ( > 25-year lifespan) and, 
furthermore, provides considerable resolution to recent evidence for a gyrencephalic eutherian 
ancestor [|T8ll by sampling nearly twice as many species and categorizing gyrencephaly as a 
continuous, rather than a binary, trait. More surprisingly, we show that convergent evolution of 
higher-orders of gyrencephaly along divergent lineages has been accompanied by two distinct 
constellations of physiological and life-history paradigms. Specifically, species with a GI > 1.5, 
which is commensurate with one billion cortical neurons, exhibit patterns of development and 



®Z"/ ®12n: ®Z(f 



i)n: 



®i(n,-2+f)n, ®t{2n:-5+f)n,*c 




^ apical RG 
^ neuron 

QiPC 

basal RG 

Qtap 



self-renewing 




24- 












Human 9 


23- 












Gorilla Q 
Orangutan 


22 










Rhesus © 
Baboon ^ 


• 


21 
w 

i 20- 






▲ 


▲ 


Capuchin ® 

• 


% 


19- 




▲ 






• 
O 




18- 




▲ 


A Rabbit 


.Tree shrew 






17 




^Rat 
A Mouse 










16 


^,— ^ 













2.5 3 3.5 

In neurogenic period (days) 



4.5 



Figure 5: Distinct combinations of progenitor lineages are required to predict 
neocortical neuron numbers for low- versus high-GI species, (a) Schematic of the 
five progenitor lineages used in various combinations to predict neuron numbers. Series 
above the schematics summarize the neuronal output of each lineage, where «,• is the 
number of / divisions. A constant, c = 0.989, is incorporated into the series for lineage 
5, allowing the series to converge on the true value of the lineage output as the number 
of divisions becomes increasingly numerous, (b) Ln-transformed plot of observed neuron 
counts as a function of neurogenic period for 4 species with a Gl < 1.5 (open blue 
triangles) and 6 species with a Gl > 1.5 (open red circles). Predicted neuron counts 
were calculated using combinations of the lineages in (a) that accurately fitted to the 
observed neuron counts either for mouse (closed gold symbols) or human (closed green 
symbols). Note that the mouse model implicates only lineages 1-3 and the human 
model only lineages 2-5. See Table 1 for observed and predicted neuron counts and 
Table 2 for the proportional contribution of each lineage for mouse and human. 
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life-history that are distinct from species with a GI < 1.5, irrespective of phylogeny. This im- 
plies that there is a considerable constraint on either the ability of species of a given neocortical 
size to exploit certain ecologies or the potential for species of a given ecology to freely adapt 
neocortical size. Furthermore, no species - with the exception of the house cat {Felis catus), 
which may be under unique selection pressures due to its ten-thousand-year-old domestication 
[|28l - falls within the limits of the GI or neuronal threshold range (Fig. 3d). While our re- 
sults countenance previous studies showing associations between physiological and life -history 
traits in mammals (see [|29ll ), we identify those traits to have a bimodal distribution, rather than 
to vary allometrically, across species. This distribution depicts a Waddington-type landscape 
for neocortical expansion - albeit relevant at the species-level wherein the threshold represents 
an adaptive peak requiring a particular adaptation in neurogenic programming within a popula- 
tion for traversal. Our results may explain this landscape by mechanistic differences occurring 
during cortical neurogenesis between species above and below the threshold: the necessity of 
bRG-derived TAPs in high-GI species and their putative absence in low-GI species. The adap- 
tation of bRG-derived TAPs may be tantamount to a relaxation of constraints along lineages 
leading to larger-brained species [|30l . Furthermore, our human model clearly shows that the 
same neurogenic lineages in the same proportions are required to generate the neocortices of 
monkeys, apes, and humans, and may theoretically be extended to carnivores, artiodactlys, and 
other high-GI species, demonstrating that neurogenic period alone may be sufficient to explain 
differences in neocortical size between any species in the same GI group. If differences in neu- 
rogenesis among high-GI species can be largely explained by variation in neurodevelopmental 
timing, we may expect conservation at the genomic level in regions regulating that timing [[311 . 
We propose that bRG-derived TAPs, rather than simply an abundance of bRG in an ex- 
panded SVZ, are necessary and sufficient for the evolution of an expanded and highly folded 
neocortex in mammals. We conclude that the inclusion of TAPs in the basal neurogenic pro- 
gram is the adaptive requirement for traversing an evolutionary threshold. But because we 
reconstruct the eutherian ancestor to have a GI value of 1.48 ± 0.13 (s.e.m.), which falls within 
the range of the observed threshold, we are left with an ambivalent evolutionary history for 
mammalian neocortical expansion: either (i) bRG-derived TAPs are ancestral to all eutherian 
mammals and were selected against along multiple lineages (e.g., rodents, strepsirrhinies), so 
that the ultimate loss of TAPs in certain taxa, and therefore the evolution of low-GI species, is 
the result of divergent developmental adaptations; or (ii) bRG-derived TAPs are not ancestral 
to eutherian mammals, but evolved convergently along multiple lineages, in which case the de- 
velopmental process for their inclusion in neurogenic programming may be conserved, even if 
that process was unexpressed for long stretches of mammalian evolution. While both of these 



11 



histories are speculative, we have nonetheless revealed an important insight into mammalian 
evolution: neocortical expansion beyond a certain threshold requires a specific class of progen- 
itor cell-type; and the difference in neurogenic programming between any species within the 
same GI group does not require novel progenitor-types or adaptations in progenitor-type be- 
havior. Further research into the conservation of genomic regions regulating the expression of 
bRG-derived TAPs, either at the ventricle or through maintenance of a proliferative niche in the 
SVZ, in low- versus high-GI species may be sufficient to determine whether the mechanism for 
neocortical expansion has evolved independently in distantly related species or is the product 
of a deep homology in mammalian neurogenesis. 

Materials and Methods 
Calculating GI 

We calculated GI using images of Nissl-stained coronal sections from http://brainmuseum.org. 
We used 10-22 sections, equally spaced along the anterior-posterior axis of the brain, for each 
species (Fig. SI). The inner and outer counters of the left hemisphere were traced in Fiji 
(http://fiji.se/wiki/index.php/Fiji). The values calculated are marked with an asterisk in Table 
SI. Additional GI values were collected from the literature (Table SI; External Database 1). 
Work in humans and baboons has shown that inter-indvidual variation in GI is not enough to 
outweigh interspecific differences [|32ll33l . 

Stochastic mapping of GI across the mammalian phylogeny 

We used a comprehensive phylogenetic approach to map 41 life-history and physiological char- 
acter traits collected from the literature (Tables S1,S2) onto hypotheses of phylogenetic rela- 
tionships in Mammalia, in order to examine how those traits correlate, over evolutionary time, 
with degree of gyrencephaly. Continuous character traits were discretized using the consen- 
sus of natural distribution breaks calculated with a Jenks-Caspall algorithm Il34l , model-based 
clustering according to the Schwarz criterion [[35l . and hierarchical clustering [[36ll . Character 
histories were then corrected for body mass with a phylogenetic size correction [37J and sum- 
marized across the phylogeny using posterior probabilities. Associations between individual 
states of each character trait along those phylogenetic histories were calculated in SIMMAP 
(vl.5) using empirical priors ll38l ; the association between any two states was a measure of the 
frequency of occurrence (i.e., the amount of branch length across the tree) of those states on 
the phylogeny. The sums, rates, and types of changes for GI and body weight were plotted as 
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mutational maps to assess directional biases in their evolution [I39I - I4TI . The phylogeny used in 
this analysis was derived from a species-level supertree Il42]| . 

Reconstructing the evolutionary history of GI 

Variation in the mode and tempo of a continuous character trait is not always best characterized 
by a random walk (i.e., Brownian motion). Therefore, we compared a range of evolutionary 
models on the phylogenetic distribution of GI to find the best fit for the data [|43l - l46l . Log- 
likelihood scores for each model were tried against the random walk score using the cumulative 
distribution function of the X^ distribution. Maximum-likelihood ancestral character states of 
GI and rate-shifts in the evolution of GI were then constructed using the best-fit model, with the 
standard error and confidence intervals calculated from root node reconstruction in PDAP using 
independent contrasts fl7l - |49ll . To trace evolutionary changes in GI at individual nodes and 
along lineages, we used a two-rate mode that highlighted the differences in high (> 1) versus 
low (< 1) root-to-tip substitutions and then sampled rates based on posterior probabilities across 
the tree using a Monte Carlo Markov Chain. We assumed that transitioning between adjacent GI 
values had the highest likelihood of occurrence. The rate at a given node could then be compared 
to the rate at the subsequent node to determine if a rate transition was likely. We corroborated 
these results using the auteur package [|50l . which calculates rate-transitions at internal nodes 
under the assumption of an Omstein-Uhlenbeck selection model [ISTl over one million Monte 
Carlo sampling iterations drawn from random samplings of posterior distributions of lineage- 
specific rates. Scaling relationships were determined for GI as a function of all continuous life- 
history and physiological traits, including adult cortical neuron counts. For three insectivore 
(Sorexfumeus, Blarina brevicauda, Scalopus aquaticus) species, data were available for neuron 
counts but not GI, and therefore we extrapolated the GI of those species based on their closest 
phylogenetic relatives. Finally, to test whether the bimodal distribution of GI may be influenced 
by the topology of the mammalian phylogenetic tree, we used an expectation-maximization 
algorithm. Each simulated trait was given the same variance as GI (Fig. S5) and the result 
was averaged over 10^ simulated datasets. None of the simulations produced the same bimodal 
distribution of species observed for GI data. 

Estimating neuroepitheUal founder pool populations 

We estimated neuroepithelial founder pool populations for mouse and human. For the mouse, 
we used coronal sections of an El 1.5 mouse embryo obtained from the Allen Brain Atlas [|52l . 
We obtained 19 sections equidistantly spaced along the anterior-posterior axis of the brain. The 
length of the ventricular surface of the dorsal telencephalon was manually traced in Fiji ll53ll 
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on each section starting from the point above the nascent hippocampus and ending in the point 
above the lateral ganglionic eminence. The horizontal length of the embryonic brain at El 1.5 
was measured with images from ||54|| . Using the coronal and horizontal measurements, we 
constructed a polygon representing the ventricular surface of the dorsal telencephalon and cal- 
culated the area of this surface in Fiji. We measured the surface area of the end-feet of neuroep- 
ithelial cells using EM images of the coronally cut apical surface of an El 1.5 embryonic brain 
(Table S5). The diameter of a single cell was calculated by measuring the distance between 
the adherens junctions. We corroborated these end-feet calculations with published immunoflu- 
orescence stainings of the apical complex (ZOl and N-cadherin) from an en face perspective 
[|55ll56l . The average surface area of a single end-foot was calculated by approximating the 
end-foot as a hexagon; and the number of founder cells was estimated by dividing the surface 
of the dorsal telencephalon by the surface of an individual end-foot of the neuroepithelial cell, 
such that 

Surfacearea(iim^) \/3 

— founders (1) 



2n{2Endfootdiameter{iim^) 2 

Our final mouse values were comparable to those previously published [|57l . For the human, we 
followed the same procedure, using 10 coronal sections and one horizontal section of a gestation 
week (GW) 9 brain [l58|. End- feet were calculated using EM images of the apical surface of a 
human brain at GW13. The measurements are available in Table S5. Because the number of 
founder cells per surface area was nearly equivalent in mouse and human ( 4 x 10^/mm^), we 
used this ratio, along with data on ventricular volume collected from the literature (Table S 1 ; 
Table S2; External Database 1), to estimate neuroepithelial founder cell populations for a further 
14 species (Table 1). For species where no data on ventricular volume were available, values 
were estimated based on a regression analysis against brain weight (Fig. S6b). Ventricular 
volume was then converted to surface area for each species by approximating the ventricle as a 
cylinder with a 4.5-to-l height-to-diameter proportion. Ventricular volume-derived ventricular 
surface area estimates were corroborated with the surface areas calculated from the literature for 
mouse and human. Founder cell estimates were then computed based on the densities derived 
above for mouse and human. Using this method, but alternately ignoring our mouse and human 
calculations to define the parameters, we were able to predict mouse and human values within 
10% of our calculations, respectively. 
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Mathematical modeling of neurogenesis 

Workers have demonstrated the occurrence of three primary lineages of neuronal generation in 
mouse neurogenesis [l59l| and a further two lineages in human neurogenesis [6]. While there is 
evidence for at least one additional lineage in mouse ||60l . and further lineages may be spec- 
ulated, we limited our model to the five that are considered to contribute most significantly to 
neuronal output ll6TI - [63ll . The sequence of neuron generation in each of these five lineages was 
summarized in series and solved numerically (Fig. 5a). Neurogenic period was either taken 
from the literature (External Database 1) or estimated based on a regression analysis of neuro- 
genic period as a function of gestation period (Fig. S6c). Neurogenic period in human was esti- 
mated using empirical observations from the literature [|64l - [66l . The averaged cell-cycle length 
for apical and basal progenitors from the mouse (18.5 hours) was used for all non-primates [l67l; 
averaged cell-cycle length for cortical areas 17 and 18 from the macaque (41 hours) was used 
for catarrhines f6S); an intermediary cell-cycle length (30 hours) was used for platyrrhines. Di- 
minishing numbers of neuroepithelial cells have been observed to continue to proliferate at the 
ventricle until E18.5 in the mouse DUl. Therefore, final neuroepithelial founder pool estimates 
were calculated from the aforementioned by evenly decreasing the value of a in the Sherley 
equation [|69ll from 1 at E9.5 to at El 8.5 in the mouse and at comparable neurogenic stages in 
other species. Neuron numbers were calculated for each species from combinations of lineages. 
The proportional contribution of each lineage for each species was parameterized according to 
existing data on progenitor cell-type abundances in mouse [8], marmoset [fT3l . and rabbit [IK 
and WBH, in preparation]. Where no such data were available, proportional contributions were 
permutated for all lineages until a best-fit estimate, based on cortical neuron numbers taken 
from the literature [12211701 - 1721 . was achieved (Tables 1,2). Each lineage was assumed to occur 
from the first to final day of neurogenesis, although this is only approximately accurate. Fi- 
nally, because of published estimates of postnatal apoptosis in the mammalian cortex [1731 - 1751 . 
we assumed neuron counts to be 1 .5-fold higher at the termination of neurogenesis than in the 
adult brain; therefore, neuron number at the termination of neurogenesis was estimated in each 
species by multiplying neuron numbers collected from the literature by 1.5. This multiplication 
is not represented in Table 1 . 

Calculating the effects of transit-amplifying progenitors on neuronal 
output 

Trade-offs in adapting a human lineage combination with either an expanding neuroepithelial 
founder pool or lengthening neurogenic period were tested for the mouse (Mus musculus) and 
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marmoset {Callithrix jacchus), two lissencephalic species whose cell-type proportions during 
neurogenesis have been documented [l8l[T3l[60||. To estimate the relative reproductive value and 
stable-stage proportions of each of the lineages in the mouse and human models, we constructed 
a stage-structured Lefkovitch matrix, using sums of the lineage series (after 100 cycles) as 
fecundity values and complete permutations of the proportional contributions of each hneage 
as mortality values. The altered growth-rates of each lineage were calculated by excluding 
lineages one at a time and assuming 100% survival in the remaining lineages. We introduced 
three ODEs to explore the average dynamics of bRG-derived TAPs, such that: if a(t), b(t), and 
c(t) are the numbers of asymmetrically dividing cells, differentiated cells, and proliferative cells, 
respectively, then, 

at 

db 

—- = ra + 2rc (3) 

at 

f = (l-.)<, + (l-2.)c (4) 

where r is equal to growth-rate. If a(t)=ao, then 

MO = Y^(co + ^ao)(.(i-2'-)^-l)-^r + Z.o (5) 



and 

1-r 



c{t) = (CO + ^ao)e^'-^'-^' - -^ao (6) 

1 — 2r 1 — 2r 



We calculated the effect on neuronal output of increasing the likelihood of bRG-derived TAPs in 
the lineage (Fig. SIO). The interdependent growth-rates in the model reflect a purely mechanis- 
tic interpretation of determining neuronal output from a finite pool of asymmetrically dividing 
cells. The ODEs, therefore, may not reflect differential regulation of neuronal output via direct 
versus indirect neurogenesis. The proliferative cells are designed to carry out one round of pro- 
liferation followed by a final round of self-consumption. 
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Table 1 : Parameters for models of cortical neurogenesis" 



Species 


Gestation 
period (d) 


Neurogenic 
period (d) 


Observed 
Neurons 


Neuroepithelial 

founder pool 

(eel Is) f 


Cell-cycle 
length 
(hours) 


Human 


270 


112 


1.63E+10 


3.10E+07 


41 


Gorilla 


257 


103* 


9.10E+09 


1.59E+07 


41 


Orangutan 


260 


104* 


8.90E+09 


1.16E+07 


41 


Macaque 


166 


60 


1.71E+09 


4.41 E+06 


41 


Baboon 


180 


72* 


2.88E+09 


6.37E+06 


41 


Capuchin 


158 


59* 


1.14E+09 


2.97E+06 


41 


Owl monkey 


138 


55* 


4.42E+08 


1.05E+06 


30 


Calllmico 


153 


60* 


3.57E+08 


6.92E+05 


30 


Marmoset 


146 


58 


2.45E+08 


6.71 E+05 


30 


Galago 


134 


54* 


2.26E+08 


1.01 E+06 


30 


Tree shrew 


46 


19* 


6.04E+07 


5.68E+05 


18.5 


Rabbit 


30 


13 


7.15E+07 


8.08E+05 


18.5 


Agouti 


112 


45* 


l.lOE+08 


9.80E+05 


18.5 


Capybara 


137 


55* 


3.10E+08 


1.78E+06 


18.5 


Rat 


21 


10 


3.10E+07 


5.40E+05 


18.5 


Mouse 


19 


9 


1.37E+07 


3.99E+05 


18.5 



°see External Database 1 

fsee Materials and Methods 

*estimate based on regression against gestation period (see Figure S6) 



Table 2: Best-fit proportional occurrences (%) of lineages in different taxa° 



Taxa 


Lineage 1 


Lineage 2 


Lineage 3 


Lineage 4 


Lineage 5 


Catarrhines 





20 


35 


40 


15 


Capuchin 





20 


35 


40 


15 


Owl monkey 





50 


50 








Callimico 





50 


50 








Marmoset^ 





60 


40 








Galago 





75 


25 








Tree shrew 


10 


75 


15 








Rabbit^ 


10 


75 


15 








Agouti 


10 


75 


15 








Capybara 


10 


75 


15 








Rat 


10 


80 


10 








Mouse^ 


10 


80 


10 









^supported by observational data (see Materials and IVIethods) 
"^see Figure 5a 
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1 cm 

Figure SI: Coronal section of the brain of an adult house cat {Felis catus) (obtained 
from www.brainmuseum.org) illustrating the method used to calculate Gl values as de- 
scribed in [76|. Green line, actual contour; magenta line, hypothetical outer contour. 
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Figure S2: Maximum-likelihood ancestral node reconstruction of Gl values at all internal 
nodes based on a delta (5 = 2.635) selection model. Barplot shows the distribution of 
Gl values across the phylogeny; dashed red line indicates Gl = 1.5. 
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Figure S3: Rate-transitions in the mutation rate of Gl values along lineages of 
the mammalian phylogeny (a) A two-mode selection model that weights low over high 
root-to-tip substitutions. Numbers on the branches indicate the change in mutation- 
rate compared to the previous branch; values indicate no significant change, values 
> indicate significant change (P < 0.05). Note the especially high rate-transitions 
leading to primates, cetartiodactyls, and cetaceans (open blue circles), (b) Mutation- 
and transition-rate estimates of Gl values using an Ornstein-Uhlenbeck selection model. 
Branches are colored to illustrate whether the mutation-rate estimates along each lin- 
eage are above (red) or below (blue) the median rate (orange); nodes are circled to 
indicate the posterior support of a transition-rate-shift event. The gradient of colors 
(see key) indicates the degree of deviation of the mutation-rate estimates (branches) 
and transition-rate estimates (nodes) from the median, with the highest deviation being 
arbitrarily set to ± 1.0 and the median to 0.0; the size of the circles (see key) at the 
nodes indicates the degree of posterior support for a transition-rate-shift event, with the 
highest value being arbitrarily set to 1.0 and lack of support to 0.0. Note that simians 
have evolved Gl values at a rate consistent wi±h the mammalian median. 
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Figure S4: Barplots of types of transitions over mammalian evolution between 
four Gl groups (see Fig. 2a) and between five body mass groups averaged over 
105 simulations. The number of total transitions from one Gl or body mass group to 
another is summed as either high-to-low or low-to-high transitions. Note that signifi- 
cantly more high-to-low than low-to-high transitions are observed for Gl, but that no 
significant difference in type of transition is observed for body mass. 
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Figure S5: The bimodal distribution of Gl values across the phylogeny is non- 
random. A histogram showing the frequency of occurrence of Gl values, binned at 0.05 
intervals, for the 102 mammalian species listed in Table SI. Blue, Gl values < 1.5; red, 
Gl values > 1.5. The bimodal distribution of Gl values shows a natural break at Gl 
= 1.5, which is supported by energy-based hierarchical clustering (see Fig. 2b). Note 
the possibility for a third Gl group (Gl > 3, tomato red), constituting cetaceans and 
elephant; however, we have too few sampled species from these orders to assess the 
group decisively. 
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Figure S6: Ln-transformed plots of neonate brain weight (a) and ventricular 
volume (b) as functions of adult brain weight, neurogenic period as a function 
of gestation period (c); and a plot of neuroepithelial founder cells as a function 
of ventricular surface area (d).(a) Neonate brain weight scales linearly with adult 
brain weight for 52 eutherian species (y = 1.09x - 1.49, R^ = 0.92, P = 6 x 10"'^). 
(b) Ventricular volume scales linearly with adult brain weight for 30 eutherian species 
(y = 0.93x + 2.37, R^ = 0.93, P = 9 x 10"^). (c) Neurogenic period scales linearly 
with gestation period for a sample of six species (y = 0.91x - 0.42, R^ = 0.94, P = 
0.0002), spanning two mammalian superorders. Predicted neurogenic period is shown for 
human, (d) Ventricular surface area, converted from ventricular volume (see Methods), 
scales linearly with our estimated neuroepithelial founder populations (y = 6.7 x 10^ + 
878x, R2 = 0.94, P = 5 X 10"^). (a, c) Note that these plots demonstrate the strong 
predictive powers of adult brain weight and gestation period for neonate brain weight 
and neurogenic period, respectively, validating the assumptions made in Fig. 4. 
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Figure S7: Stacked barplot, for the indicated species, of deviations between the ob- 
served neocortical neuron counts and the ones predicted based on human (red), mouse 
(blue) and marmoset (yellow) lineage combinations (see Table 2 and Fig. 5). For each 
species, deviations were calculated as |100*((Predicted Observed) /Observed)] and then 
divided by the sum of deviations obtained for all three lineage combinations. Note that 
predictions based on the marmoset lineage combination deviate from observed neuron 
counts not only for the 6 species with a Gl value > 1.5 (red text), but also for 8 of the 
10 species with a Gl < 1.5 (blue text), indicating a necessity for differential proportional 
occurrences of bRG in low-GI species. 
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Figure S8: Plot of observed neocortical neuronal count (red circles) as a function of 
neurogenic period for six species with a Gl value > 1.5. Predicted neuron counts are 
presented for the human lineage combination (green circles; see Fig. 5, Table 2) and for 
two further lineages, each of which is assumed to have a 100% proportional occurrence: 
direct neurogenesis from bRG (blue circle) and indirect neurogenesis from bRG via a 
self-consuming IP cell (orange circle). Note that indirect neurogenesis from bRG via IPs 
is nearly sufficient to achieve the observed neuronal count in the Capuchin monkey. 
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Figure S9: Calculating the acJaptiveness of bRG-cJerivecJ TAPs in mouse (a-e) ancJ mar- 
moset (f-j) in achieving 10^ neurons with respect to lengthening neurogenic period ancj 
expancjing neuroepithelial founcder pool size. The folcd-change of lengthening neurogenic 
period or expanding neuroepithelial founder pool size is indicated in each relevant plot, 
(a) The observed neurogenic period and founder pool size in mouse generates 1.37 x 
10^ neurons using the mouse lineage combination, (b, c) Lengthening the neurogenic 
period (b) or expanding the founder pool size (c) using the mouse lineage combination 
to achieve 10^ neurons, (d, e) Lengthening the neurogenic period (d) or expanding the 
founder pool size (e) using the human lineage combination to achieve 10^ neurons, (f) 
The observed neurogenic period and founder pool size in marmoset generates 2.45 x 10^ 
neurons using the marmoset lineage combination, (g, h) Lengthening the neurogenic 
period (g) or expanding the founder pool size (h) using the marmoset lineage combina- 
tion to achieve 10^ neurons, (i, j) Lengtheninfi-the neurogenic period (i) or expanding 
the founder pool size (j) using the human lineage combination to achieve 10^ neurons. 
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Figure SIO: Neuronal outputs from solutions to ODEs describing direct versus 
indirect neurogenesis for growth-rate values < 0.5. Contour plot of neuronal densities 
for a varying initial asymmetrically dividing cell population (a) and likelihood of direct 
(r = 1) versus indirect (r = 0) neurogenesis. Note that neuronal output increases 
maximally when both the initial cell pool increases [a — )■ 100) and the likelihood of 
indirect neurogenesis increases (r — ^ 0). 
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